Recent improvements in voicemail transcription
نویسندگان
چکیده
In this paper we report recent improvements in voicemail transcription. The voicemail transcription task was introduced last year [1] as representing a style of conversational telephone speech that is somewhat different from the Switchboard and CallHome [2] databases. Last year, the speaker independent and speaker adapted word error rates (WER) on this task were reported at 41.94% and 38.18% respectively, in [1]. This year, we report a relative improvement of 18% in the speaker independent performance and 11% in the speaker adapted performance over last year. This improvement is a result of some new algorithms and an increase in the amount of training data. In the following sections, we describe the contribution of several components to improving the word error rate.
منابع مشابه
Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard)
In this paper we report recent improvements in word error performance on a voicemail transcription task. Last year, the speaker independent word error rate (WER) on the dev test set of the Voicemail Transcription task was reported at 35.45% [1]. This year, we report a relative 20% gain over this number. The improvements were obtained using several new algorithms and an increased amount of train...
متن کاملPerformance Improvements in Voicemail Transcription
In this paper we report recent improvements in word error performance on a voicemail transcription task. Last year, the speaker independent word error rate (WER) on the dev test set of the Voicemail Transcription task was reported at 35.45% [1]. This year, we report a relative 20% gain over this number. The improvements were obtained using several new algorithms and an increased amount of train...
متن کاملA study of adaptation techniques on a voicemail transcription task
Speaker adaptation techniques have emerged as very effective and practical methods to improve ASR performance on a test speaker with only limited speech data from the speaker. We explore the use of adaptation techniques on a new Voicemail database and present some theoretical extensions of the Cluster Transformation (CT) technique. Our experiments on 40 hours of voicemail data and four clusters...
متن کاملAutomatic speech recognition performance on a voicemail transcription task
In this paper, we report on the performance of automatic speech recognition (ASR) systems on voicemail transcription. Voicemail is spontaneous telephone speech recorded over a variety of channels; consequently, it is representative of many challenging problems in speech recognition. In the course of working on this task, several algorithms were developed that focus on different components of an...
متن کاملAutomatic transcription of voicemail at AT&T
This paper reports on the automatic transcription accuracy of voicemail messages. It shows that vocal tract length normalization and adaptation using linear transformations, proven to improve accuracy on the Switchboard task, provide similar accuracy improvements on this task. Direct application of the normalization techniques is complicated by the fragmentation of the data. However, unsupervis...
متن کامل